智能论文笔记

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Federico Bianchi , Pratyusha Kalluri , Esin Durmus , Faisal Ladhak , Myra Cheng , Debora Nozza , Tatsunori Hashimoto , Dan Jurafsky , James Zou , Aylin Caliskan

分类：自然语言处理 | 计算机视觉

2022-11-07

Machine learning models are now able to convert user-written text descriptions into naturalistic images. These models are available to anyone online and are being used to generate millions of images a day. We investigate these models and find that they amplify dangerous and complex stereotypes. Moreover, we find that the amplified stereotypes are difficult to predict and not easily mitigated by users or model owners. The extent to which these image-generation models perpetuate and amplify stereotypes and their mass deployment is cause for serious concern.

translated by 谷歌翻译

On the Opportunities and Risks of Foundation Models

Rishi Bommasani , Drew A. Hudson , Ehsan Adeli , Russ Altman , Simran Arora , Sydney von Arx , Michael S. Bernstein , Jeannette Bohg , Antoine Bosselut , Emma Brunskill

分类：机器学习 | 人工智能

2021-08-16

AI正在经历范式转变，随着模型的兴起（例如Bert，Dall-E，GPT-3），这些模型经过大规模的数据训练，并且可以适应广泛的下游任务。我们称这些模型基础模型来强调其至关重要但不完整的特征。该报告提供了基础模型的机会和风险的详尽说明，包括其功能（例如语言，愿景，机器人技术，推理，人类互动）和技术原则（例如，模型架构，培训程序，数据，系统，安全，安全性，评估，理论）对其应用（例如法律，医疗保健，教育）和社会影响（例如不平等，滥用，经济和环境影响，法律和道德考虑）。尽管基础模型基于标准的深度学习和转移学习，但它们的规模导致了新的新兴能力，以及它们在许多任务中的有效性都激发了同质化。同质化提供了强大的杠杆作用，但要求谨慎，因为基础模型的缺陷均由下游的所有适应模型继承。尽管即将广泛地部署基础模型，但我们目前对它们的工作方式，失败以及由于其新兴属性的影响而缺乏清晰的了解。为了解决这些问题，我们认为基础模型的许多批判性研究都需要与他们的基本社会技术性质相称。

translated by 谷歌翻译

The Values Encoded in Machine Learning Research

Abeba Birhane , Pratyusha Kalluri , Dallas Card , William Agnew , Ravit Dotan , Michelle Bao

分类：机器学习 | 人工智能

2021-06-29

机器学习目前对世界产生了巨大的影响，越来越多地影响机构实践并影响了社区。因此，至关重要的是，我们质疑该领域的模糊概念是价值中性或普遍有益的，并研究该领域正在发展的特定价值。在本文中，我们首先介绍了一种研究文档中编码的值的方法和注释方案，例如研究论文。采用该方案，我们分析了100个高度引用的机器学习论文，该论文在Premier机器学习会议，ICML和Neurips上发表。我们注释论文的关键特征，这些特征揭示了其价值观：他们选择项目的理由，这些项目的归因于他们提升的项目，对潜在的负面后果的考虑以及机构的隶属关系和资金来源。我们发现，很少有论文证明其项目如何与社会需求联系起来（15 \％），而讨论负潜力（1 \％）的讨论更少。通过逐行的内容分析，我们确定了59个在ML研究中得到提升的值，其中，我们发现论文最常根据绩效，概括，定量证据，效率，基于过去的绩效，定量证据，效率来证明和评估自己的合理性和评估工作和新颖。我们提供了广泛的文本证据，并在这些价值观的定义和操作中确定了关键主题。值得注意的是，我们发现系统的文本证据表明，这些最高价值是通过假设和含义来定义和应用的，通常支持权力的集中化。在本文中，我们发现这些高度引用的论文与科技公司和精英大学之间的关系越来越紧密。

translated by 谷歌翻译

Characterizing Intrinsic Compositionality in Transformers with Tree Projections

Shikhar Murty , Pratyusha Sharma , Jacob Andreas , Christopher D. Manning

分类：自然语言处理

2022-11-02

When trained on language data, do transformers learn some arbitrary computation that utilizes the full capacity of the architecture or do they learn a simpler, tree-like computation, hypothesized to underlie compositional meaning systems like human languages? There is an apparent tension between compositional accounts of human language understanding, which are based on a restricted bottom-up computational process, and the enormous success of neural models like transformers, which can route information arbitrarily between different parts of their input. One possibility is that these models, while extremely flexible in principle, in practice learn to interpret language hierarchically, ultimately building sentence representations close to those predictable by a bottom-up, tree-structured model. To evaluate this possibility, we describe an unsupervised and parameter-free method to \emph{functionally project} the behavior of any transformer into the space of tree-structured networks. Given an input sentence, we produce a binary tree that approximates the transformer's representation-building process and a score that captures how "tree-like" the transformer's behavior is on the input. While calculation of this score does not require training any additional models, it provably upper-bounds the fit between a transformer and any tree-structured approximation. Using this method, we show that transformers for three different tasks become more tree-like over the course of training, in some cases unsupervisedly recovering the same trees as supervised parsers. These trees, in turn, are predictive of model behavior, with more tree-like models generalizing better on tests of compositional generalization.

translated by 谷歌翻译

PARSE challenge 2022: Pulmonary Arteries Segmentation using Swin U-Net Transformer(Swin UNETR) and U-Net

Akansh Maurya , Kunal Dashrath Patil , Rohan Padhy , Kalluri Ramakrishna , Ganapathy Krishnamurthi

分类：计算机视觉

2022-08-20

在这项工作中，我们介绍了我们提出的方法，该方法是使用SWIN UNETR和基于U-NET的深神经网络体系结构从CT扫描中分割肺动脉的方法。六个型号，基于SWIN UNETR的三个型号以及基于3D U-NET的三个模型，使用加权平均值来制作最终的分割掩码。我们的团队通过这种方法获得了84.36％的多级骰子得分。我们的工作代码可在以下链接上提供：https：//github.com/akansh12/parse2022。这项工作是Miccai Parse 2022挑战的一部分。

translated by 谷歌翻译

Cluster-to-adapt: Few Shot Domain Adaptation for Semantic Segmentation across Disjoint Labels

Tarun Kalluri , Manmohan Chandraker

分类：计算机视觉 | 机器学习

2022-08-04

跨数据集的语义细分的域适应性，由相同类别组成，已经获得了一些最近的成功。但是，更一般的情况是源和目标数据集对应于非重叠标签空间时。例如，分割数据集中的类别根据环境或应用程序的类型发生了很大变化，但共享许多有价值的语义关系。基于特征对齐或差异最小化的现有方法不会考虑此类类别的转移。在这项工作中，我们提出了群集到适应（C2A），这是一种基于计算有效的聚类方法，用于跨分割数据集的域适应性，这些方法完全不同但可能相关类别。我们表明，在变换的特征空间中强制执行的这种聚类目标可以自动选择跨源和目标域的类别，这些类别可以对齐以改善目标性能，同时防止对无关类别的负转移。我们通过实验对室外的挑战性问题进行了实验，以少量拍摄和零拍设置来证明室内适应性的挑战性问题，在所有情况下，性能对现有方法和基准的绩效持续改善。

translated by 谷歌翻译

MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation

Tarun Kalluri , Astuti Sharma , Manmohan Chandraker

分类：计算机视觉 | 人工智能 | 机器学习

2022-07-25

实用的现实世界数据集具有丰富的类别，为无监督的领域适应带来了新的挑战，例如小型阶层歧视性，仅依靠域不变性的现有方法不能很好地处理。在这项工作中，我们提出了MEMSAC，该MEMSAC利用了跨源和目标域的样本级别相似性，以实现判别性转移，以及扩展到大量类别的体系结构。为此，我们首先引入一种内存增强方法，以在标记的源和未标记的目标域实例之间有效提取成对的相似性关系，该实例适用于处理任意数量的类。接下来，我们建议和理论上证明对比损失的新型变体，以促进阶层内跨域样本之间的局部一致性，同时在类别之间执行分离，从而保留从源到目标的歧视性转移。我们验证了MEMSAC的优势，比以前的最先进的最先进的转移任务有了显着改进。我们还提供了深入的分析和对MEMSAC有效性的见解。

translated by 谷歌翻译